Shapley Q-Value: A Local Reward Approach to Solve Global Reward Games
نویسندگان
چکیده
منابع مشابه
Average Reward Timed Games
We consider real-time games where the goal consists, for each player, in maximizing the average reward he or she receives per time unit. We consider zero-sum rewards, so that a reward of +r to one player corresponds to a reward of −r to the other player. The games are played on discrete-time game structures which can be specified using a two-player version of timed automata whose locations are ...
متن کاملCOVARIANCE MATRIX OF MULTIVARIATE REWARD PROCESSES WITH NONLINEAR REWARD FUNCTIONS
Multivariate reward processes with reward functions of constant rates, defined on a semi-Markov process, first were studied by Masuda and Sumita, 1991. Reward processes with nonlinear reward functions were introduced in Soltani, 1996. In this work we study a multivariate process , , where are reward processes with nonlinear reward functions respectively. The Laplace transform of the covar...
متن کاملShapley value for assignment games ∗
We consider the problem of the axiomatization of the Shapley value on the class of assignment games. We show that Shapley’s original [21], Young’s [24], Chun’s [7], van den Brink’s [2], (5-6) Hart and Mas-Colell’s [12] potential function and consistency approaches and Roth’s [19] characterization do not work on the class of assignment games. We also consider Myerson’s [15] axiomatization of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence
سال: 2020
ISSN: 2374-3468,2159-5399
DOI: 10.1609/aaai.v34i05.6220